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Modern mathematics tends to obliterate history: each new school 
rewrites the foundations of its subject in its own language, which 
makes for fine logic but poor pedagogy. 

R. Hartshorne 

1 Introduction 

For better or for worse, mathematics tends to be taught and studied ahistori- 
cally. We tend to present theorems in an order that makes sense after the fact, 
valuing brevity over exposition, conciseness over connection, and a certain kind 
of logical purity at the expense of motivation. 

One case in point is the typical treatment of compactness. Compactness has 
come to be one of the most important and useful notions in advanced mathe- 
matics. However, most textbooks do little to elucidate where the notion came 
from, how the definition developed, or what deep and important questions the 
idea of compactness attempted to answer. 

The treatment of compactness in Rudin is typical. The main findings of 
compactness are presented as isolated facts, such as: 

Theorem 1.1 (Weierstrass) Every hounded infinite subset of R'^ has a limit 
point in R^ p. 40]. 

Rudin does not help students make the connection between this theorem and 
the fact that a continuous function achieves a maximum on a closed, bounded 
interval (which is stated two chapters later). Nor does Rudin explain that the 
continuous function theorem, originally proved by Weierstrass, was one of the 
main motivations for the notion of compactness. In fact, the theorem is stated 
as an application for compactness rather than a motivation. 

*This paper is a condensed, revised version of my masters thesis I39| . The research for 
this project was carried out at UC Berkeley, Umea University, and Columbia University. 
I am grateful to the many mathematicians and librarians at these universities, including 
Hendrik Lenstra and Hans Wallin, who patiently answered my many questions as well as two 
anonymous reviewers who gave me valuable feedback on an earlier version. 



The goal of this paper is to fill in some of the gaps left out in a standard 
treatment of compactness. In the first section, we discuss the original motiva- 
tions for the idea. In the second section, we trace the development of what have 
become the two main characterizations of compactness, in terms of sequences 
and open covers. In the third section, we discuss how sequential compactness 
can be generalized, using nets and filters, to make it equivalent to open cover 
compactnes^ 

1.1 Concept Map 

We begin with a map of the history covered in this paper. This map, on the 
following page and roughly chronological from top to bottom, lists the major 
landmarks in the development of topology and other related fields and their 
influences on each other. 

1.2 Terminology 

We know now that there are many notions related to (but not necessarily equiv- 
alent to) compactness. Table [l] contains a list of some of these notions. 

We also know now how these compactness notions are related. For instance, 
compactness implies countable compactness implies limit point compactness. 
Sequential compactness implies countable compactness. And if we put further 
restrictions on our spaces we can get implications in the other direction. In Ti 
spaces, limit-point compactness implies countable compactness. In first count- 
able spaces, countable compactness implies sequential compactness. In second 
countable spaces, sequential compactness implies compactness. In particular, 
we know that in metric spaces, which turn out to be second countable, the first 
four notions of compactness in Table [T] are equivalent. 

However, it took some time as compactness was applied to different types 
of spaces for relationships like these to be worked out. And to make things 
even more difficult, the names for these notions did not stabilize for some time. 
Table [2] lists different terms used for compactness-related ideas used by some 
of the most influential mathematicians in the historical development. In this 
paper I will use the modern names listed in Table [2] and will focus mostly on 
open-cover and sequential compactness. 

2 Possible motivations for compactness 

Compactness grew out of one of the most productive periods of mathematical 
activity. In middle to late nineteenth century Europe, advanced mathematics 
began to take the form which we know today. In the background was Cantor's 

^The third section, which includes a discussion of nets and filters, is more demanding 
technically than the other two sections. These topics are not always included at the under- 
graduate level, although one popular textbook [36) has recently included the topic of nets as 
a supplement to the chapter on compactness. 
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Figure 1: Concept map. 
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[open-cover] compact: Every open cover has a finite subcover. 
(also called the Borel-Lebesgue property) 

sequentially compact: Every sequence has a convergent subse- 
quence. 

countably compact: Every countable open cover has a finite sub- 
cover. 

limit-point compact: Every infinite subset of X has a limit point 

in X. 

(also called Frechet compact or the Bolzano- Weierstrass property) 
relatively compact: The closure is compact. 

quasi- compact: Compact to Bourbaki. 

pseudo-compact: Each continuous real valued function on X is 
bounded. 

finally compact: index of compactness is Kq. 
(also called Lindclof compact) 

Table 1: Flavors of compactness 



Who 


When 


Their term 


Modern term 


Frechet 


1906 


compact 


relatively 

compact 


extremal 


sequentially 
compact 


Russian School 
(Alexandroff, etc.) 


1920's 


bicompact 


compact 


compact 


countably 

compact 


Bourbaki 


1920's 


quasi-compact 


compact 


compact 


compact and 
Hausdorff 



Table 2: Names of compactness-related terms 
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work establishing the beginning of a systematic study of set theory and point-set 
topology [S3]. Also, many mathematicians — including Cantor, Weierstrass, and 
Dedekind — were worried about the foundations of mathematics and began to 
make rigorous many of the ideas that had for centuries been taken for grantetQ 
While some of the nineteenth century work can be traced to mathematical 
concerns of the early Greeks, the level of rigor and abstraction provided a sort 
of revolution of mathematical thought. 

It is in this context that we will discuss some specific problems that appear 
to have motivated the concept of compactness. In particular, we will discuss the 
influence of the study of properties of closed bounded intervals of real numbers 
(which I will denote [a, 6]), spaces of continuous functions, and solutions to 
differential equations. 



2.1 Properties of [a,b] 

In mid and late nineteenth century, mathematicians worked out what we now 
know as properties of the real line. There were two camps that ended up in- 
fluencing the notion of compactness. One was that of Bolzano and Weierstrass 
who, roughly half a century apart, studied functions defined on sequences of 
real numbers. The other camp included mathematicians like Heine, Borel, and 
Lebesgue, who looked at topological features, such as the covering of sets by 
open neighborhoods. We will examine both of these camps in more detail. 

One of the central questions involved in the study of behavior of sequences 
concerned the behavior of continuous functions defined on closed, bounded in- 
tervals of the real line. Weierstrass proved rigorously in 1877[^ the following 
theorem, corresponding to Figure 2: 

Theorem 2.1 (Weierstrass) Each function continuous in a limited [equiva- 
lent to modern-day "closed and bounded"] interval attains there at least once its 
maximum \44\ P- 244]- 



Frechet, who first defined compactness, did so in a paper entitled Generalisation 
d'un theoreme de Weierstrass, in which he explicitly tries to generalize Theorem 



2.1 to abstract topological spaces 18,, p. 

To prove his theorem, Weierstrass built on the work of Bolzano, who had 
proved back in 1817 the following: 

Lemma 2.2 (Bolzano) // a property M does not apply to all values of a vari- 
able quantity x, but to all those that are smaller than a certain u, there is always 



■^There was some work on foundational questions as early as the 18th century, which arose 
in response to an attack on mathematical foundations by philosopher Bishop Berkeley and 
an increased need for mathematicians to teach. However, many 18th century mathematicians 
produced results without a concern for rigorous foundations. See 1211 p. 23-8] for more details. 

^This date refers to one of the earliest publications of the theorem, however it is likely 
that Weierstrass actually proved it years before. As noted in the introduction, Weierstrass 
was very concerned about rigor so was reluctant to publish his proofs. However, he was very 
active and influential, so his ideas were disseminated orally. 
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Figure 2: A continuous function on [a,b]. 

a quantity U which is the greatest of those of which it can be asserted that all 
smaller x possess the property M f^flt p. 150]. 



< ) ► 

U 

< M ►! 

Figure 3: Illustration of Bolzano's lemma. 

This lemma, known today as the greatest lower bound property for real 
numbers (any subset of the real line which is bounded below has a greatest 
lower bound [38l p. 426]), was used by Bolzano to prove the Intermediate Value 
Theorem: if / is continuous on [a, h] with /(a) < and f{h) > , then for some 
X between a and 5, f{x) will be exactly 

In particular, Bolzano's lemma allowed Weierstrass to prove that every 
bounded infinite set of real numbers has a limit point. And it is this prop- 
erty that Frechet used when he generalized Weierstrass 's theorem to abstract 
spaces. We now know this property as the Bolzano-Weierstrass property^ or 
limit-point compactness. 

While Bolzano and Weierstrass were trying to characterize properties of 
the real line in terms of sequences, other mathematicians, such as Borel and 
Lebesgue were trying to characterize it in terms of open covers. Borel proved 
the following in his 1894 thesis: 

*There is an error in the proof of the Intermediate Value Theorem, namely that to prove 
a limit exists, Bolzano assumes a limit to exist. Nonetheless, most of the ideas in this paper 
are correct, and the paper quite sophisticated for its time. See [29j for an insightful analysis. 
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Theorem 2.3 (Borel) // there is an infinite number of partial intervals on a 
line segment, such that every point on the line is in at least one of the intervals, 
then one can effectively find a finite number of intervals from the given intervals 
having the same property ( any point on the line is in at least one of them.) 
p. 43]. 

It turns out that Borel's approach was similar to the approach Heine used 
to prove in 1872 that a continuous function on a closed interval was uniformly 
continuous. Interestingly, Heine doesn't appear to have recognized how his 
method of proof could be generalized to prove the covering theorem |46l p. 
108]. Moreover Heine's theorem was first proven by Dirichlet in his lectures of 
1852, with a more explicit use of coverings and subcoverings than in Heine's 
theorem \W, p. 91]. However Dirichlet's notes were not published in 1904, 
which might explain why he does not get credit for the generalized version of 
the Borel theorem. The reason that Heine's name is attached to the theorem is 
that Schonflies, a student of Weierstrass, noticed the connection between Heine's 
work and Borel's. The generalized theorem, which is now commonly called the 
Heine-Borel theorem, with modern notation, is: 

Theorem 2.4 (Heine-Borel Theorem) A subset of i?" is compact iff it is 
closed and bounded \41\ p- 39]. 

While Heine is credited with a theorem he did not prove, it appears that 
Cousin was largely overlooked for a theorem he did prove. In 1895, he general- 
ized the Borel Theorem for arbitrary covers: 

Theorem 2.5 (Cousin's Theorem) If to each point of a closed region [equiv- 
alent to modern-day "closed and bounded"] there corresponds a circle of finite 
radius ["neighborhood"], then the region can be divided into a finite number of 
subregions such that each subregion is interior to a circle of a given set having 
its center in the subregion [26, p. 29^ 

Cousin's theorem is generally attributed to Lebesgue, who was said to be 
aware of the result in 1898 and published his proof in 1902 p. 29]. 

While there is some debate over who was really responsible for the ideas and 
proofs, it was well known in Western mathematics right about the time that 
Frechet first defined compactness that any closed bounded subset of i?" has the 
open-cover property (sometimes called the Borel-Lebesgue property). And, as 
we will see, after some deliberation, this open-cover property became accepted 
as the definition for compactness. 

2.2 Spaces of continuous functions 

A second motivation for the notion of compactness was the study of abstract 
topological spaces like spaces of continuous functions, C°. In C", points are 

^From n^, p. 22]. 
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functions (whereas in [a,b] points are real numbers). The properties of [a,b] 
alone might not have been seen as important to generalize if it wasn't the case 
that the properties seemed to be important in more abstract spaces as well. 

However it turned out that infinite dimensional spaces like C° were not as 
well-behaved as finite dimensional spaces like i?" . For instance, closed, bounded 
subsets of continuous functions on i?" do not necessarily have the Bolzano- 
Weierstrass or open-cover property. The work in this area was done by Ascoli 
and Arzela in the last decades of the 1800's. 

The following example [28l p. 238] illustrates that a closed, bounded subset 
of continuous functions on i?" is not, in our modern language, sequentially 
compact. 

Consider B, the set of continuous functions, /, defined on [0, 1] with ||/j| < 1. 
(This is the closed unit ball in C°[0, 1] and 11 II is the sup norm.) We will show 
that there is a sequence in B that does not have a convergent subsequence. Let 
fn{x) — x". It lies in B. But we cannot find a subsequence that converges to a 
function in Suppose / is such a function. Then 



Since / is a discontinuous function, it is not in C'^. Hence the sequence fn{x) 
has no convergent subsequence. 

The problem in this example comes from how the functions converge. If 
convergence means pointwise convergence, we do not get behavior analogous 
to that of, say, sequences in closed unit balls of i?". In order to avoid this 
problem, Ascoli introduced the notion of equicontinuity (Bourbaki [2], Book 
10). Equicontinuity requires functions to converge to a limit all at once instead 
of pointwise. 

Definition 2.6 A set E is equicontinuous iff for all e > there exists an S > 
such that \s — t\ < S and f € E imply |/(s) — f(t)\ < e. 

The Arzela-Ascoli theorem then states: 

Theorem 2.7 (Arzela-Ascoli) Any bounded equicontinuous sequence of func- 
tions in C'^[a,b] has a uniformly convergent subsequence '35, p. 279]. 

And using modern terminology we can state a consequence of this theorem, 
analogous to the Heine-Borel theorem: 

Theorem 2.8 A subset of C'^[a,b] is compact iff it is closed, bounded and 
equicontinuous fSSl p. 279]. 



f{x) = lim /„,(x) 



which would imply (by pointwise convergence) that 
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Ascoli proved the sufficiency of this condition in 1883 and Arzela the neces- 
sity in 1893 [T71 p. 392]. This generalization of Bolzano- Weierstrass (although 
not stated in terms of compactness) was apparently well known after 1880. 
Moreover, Hilbert seems to have discovered this property independently and 
pubhshed it in 1900 [H p. 82]. 

It is unclear whether Arzela and Ascoli themselves were aware of how their 
work was connected with compactness. But it is clear that Frechet's work was 
influenced by theirs [Ml p. 255]. 



2.3 Solutions to differential equations 

A third motivation for the notion of compactness came from the desire to find 
solutions to differential equations. Peano, a contemporary of Arzela and Ascoli 
as well as a fellow Italian, realized that the Arzela-Ascoli theorem might be 
useful for demonstrating the existence of such solutions. He searched for solu- 
tions by making a sequence of approximations. He then used what we now call 
compactness to show that there will be a subsequence that converges to a limit 
(which will be the solution to the differential equation). 
Peano proved the following theorem in 1890: 

Theorem 2.9 (Peano) Let f{x, y) be bounded and continuous on a domain 
G. Then for each interior point [xq, j/q) of G, at least one integral curve of the 
differential equation 

-v- = /(a;,y) 
ay 

passes through that point p. 29]. 

While it is not clear if Frechet was aware of this application, it is important to 
note that there already seemed to be applications for the notion of compactness 
before it was formally defined. 

It is also interesting to think about what conditions must be satisfied in 
order for the conclusion of a theorem to become a definition. One reasonable 
requirement seems to be that the notion should have practical use. And we have 
seen now three uses for compactness: on the real line, in spaces of continuous 
functions, and for showing the existence of solutions to differential equations. In 
other words, we see that compactness notions already seemed to be important in 
the fields we now call real analysis, topology, functional analysis, and differential 
equations. So it seems the time was ripe for someone to propose to give a name 
to and formally define this notion, as we will see in the next section. 



3 Developing the definition 

We will trace below the development of the two central notions of compactness 
discussed above, those stemming from sequences and open covers of real num- 
bers. Again, it is useful to know something about the climate of the mathematics 
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community at the time of these historical developments. We will focus on the 
contributions of only a few main characters, but there was a large supporting 
cast of mathematicians who were developing ideas that are now the founda- 
tions for analysis and topology. There were many groups of mathematicians in 
very close contact with each other, so much so that it is difhcult to tease apart 
their contributions. Among them, in France, were Hadamard, Lebesgue, and 
Frechet. In Russia, Alexandroff and Urysohn. In Germany, HausdorfF, Hilbert, 
Schonflies, and Cantor. In Hungary, Riesz. In the U.S., Chittenden, Hedrick, 
and Moore. 

We will start with the work of Frechet, who coined the term "compact" and 
gave definitions for what we now know as countable and sequential compact- 
ness. We will then briefly discuss contributions by Alexandroff and Urysohn 
who developed and stated what we now call compactness. We will show why 
these notions of compactness are not equivalent, providing motivation for a 
generalization of sequential compactness in abstract topological spaces. 

3.1 Frechet: Countable and limit-point compactness 

While Frechet was influenced by many contemporaries and predecessors, it 
seems he deserves credit as the father of compactness. It was Frechet who first 
recognized that the notion deserved a name. Frechet also defined metric spaces 
for the first time and made inroads into functional analysis, thus providing a 
context for which the importance of compactness became clear. 

Frechet was a mathematician of big ideas. He preferred definitions that had 
an intuitive feel rather than analytic power. This preference can be seen in his 
1904 paper where he defined compactness and developed some of the ideas that 
would end up in his 1906 thesis. He says that he captured the intuitive idea of 
what we now call countable compactness by using nested intersections. 

Definition 3.1 A set E is called compact if, whenever {En} is a sequence of 
nonempty, closed subsets of E such that En+i is a subset of En for each n, there 
is at least one element that belongs to all of the En 144\ V- ^44] (translated 
with modern language). 

The exact nature of Frechet's intuition for this definition is unclear, but here 
is a guess. One thing the nested intersection definition allows us to quickly see 
is that sets that have "holes" are not compact. For instance, we can see that 
X — [a, b] is compact and Y — [a, b) U (6, c] is not. 

In the latter case, consider Ei — [a^, 6) U (, 6, Ci] where a^+i > Oi and Oi ^ b 
and Ci ^ b . These sets are clearly nested and are closed in Y, but the infinite 
intersection of those intervals is empty. Hence Y is not compact. 

The nested intersection property also allows us to easily rule out sets that 
have tails running to oo. For instance, we can show that R is not compact. 

^From the original French: Enfin nous appellerons itensemble compact tout ensemble E tel 
qu'il existe toujours au moins un element commun a unc suite infinie quelconque d'ensembles 
El, E2, ... E„, contenus dans E, lorsque ceux-ci (possedant au moins un element chacun) 
sont fermes et chacun contenu dans le precedent [181 p. 849]. 
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Figure 4: Nested intersections. 



-4 E [ E i ► 

n n + 1 n + 2 n + 3 . . 



Figure 5: Nested tails. 

Let En = [n,oo). Each En is closed because it contains its limit points, and 
clearly En+i C En- But the infinite intersection of these intervals is empty, so 
R is not compact. 

While Frechet preferred his intuitive definition involving nested intersec- 
tions [38l p. 430], he realized the importance of also providing a more useful, 
if less intuitive, definition. He claimed that the following definition, which uses 
the Bolzano- Weierstrass property, is much more useful: 

Definition 3.2 A set E in a space C is [relatively limit-point] compact if and 
only if every infinite subset of E has at least one limit element in C (but not 
necessarily in E]^ 144[ P- 244]- 

In Frechet's 1906 thesis, this characterization is given as the definition, while 
the definition from his 1904 paper, in terms of nested intersections, is given as 
a theorem. 

When comparing these two definitions, it is good to keep in mind that 
Frechet's work came before the study of abstract topological spaces was clearly 
worked out. (Hausdorff 's important book on the subject was not published until 
1914.) So while Frechet was trying to state compactness for abstract topolog- 
ical spaces, he was basically working within metric spaces where limit-point, 
countable (and sequential) compactness are equivalent. 

Another interesting detail in the history of compactness concerns why Frechet 
chose the name "compact." When Frechet first introduced the term, some math- 
ematicians did not like his choice. For instance, Schonflies suggested that what 
Frechet called compact be called something like "liickenlos" (without gaps) or 
"abschhessbar" (closable) dH p. 266]. One can see here, too, a sign that basic 

^From the original French: Nous diron qu'un ensemble est iteompact lorsqu'il ne comprend 
qu'un nombre fini d'elements ou lorsque toute infinite de ses elements donne lieu a au moins 
un element limite [191 p. 6]. 
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notions of topology were not yet worked out. For instance the term "liickenlos" 
seems more appropriate for the notion of completeness. 

However, despite all of Frechet's early concern with intuitive definitions and 
choice of terminology, it is surprising that at the end of his life, he could not 
remember why he chose the term: 

"... j'ai voulu sans doute eviter qu'on puisse appeler compact un 
noyau solide dense qui n'est agremente que d'un fil allant jusqu'a 
Tinfini. Cost unc supposition car j'ai completement oublie les raisons 
de mon choix!" [35J p. 440] 

[Doubtless I wanted to avoid a solid dense core with a single thread 
going off to infinity being called compact. This is a hypothesis be- 
cause I have completely forgotten the reasons for my choice!] 

So even in the lifetime of the mathematician who named the concept, the 
original intuition behind the concept was somewhat lost. And Frechet's intuitive 
nested intersection definition was supplanted by less intuitive but more powerful 
notions of limit-point, sequential, and — as we will examine next — open-cover 
compactness. 

3.2 Alexandroff and Urysohn: Open-cover compactness 

While Frechet was the first to use the term compactness for the Bolzano- 
Weierstrass property, his contemporaries in Russia, Alexandroff and Urysohn, 
are credited with defining a related term, orginally called bicompactness, based 
on the Borel-Lebesgue property, which has come to be the modern notion of 
compactness [U p. 425S]. Alexandroff and Urysohn were actually in close con- 
tact with Frechet [45l pp. 319-357] Q 

At the time that Alexandroff and Urysohn introduced their definition based 
on the Borel-Lebesgue property, they actually thought that the Bolzano- Weierstrass 
property was a more important concept. It took some time before it became 
clear that the Borel-Lebesgue property was more important. Probably what 
tilted the balance was Tychonoff's theorem, proved first for a special case of 
closed unit intervals in 1930 and in general in 1935, which stated that an in- 
finite product of compact sets is compact [THj. Tychonoff's theorem holds for 
open-cover compactness, but not for sequential compactness. 

Also, though Alexandroff and Urysohn usually get credit for defining open- 
cover compactness, Frechet was not unaware of the possibility of using neigh- 
borhoods to characterize compactness. In a correspondence in 1905, Hadamard 
suggested that Frechet think in terms of neighborhoods to generalize the prop- 
erties of the real line to abstract topological spaces. And the first definition that 
Frechet gave, in terms of nested intersections, is the dual of (and hence logically 
equivalent to) countable open-cover compactness. 

^Urysohn died in a swimming accident off the coast of France at the age of 26. Much of 
his work was published posthumously by Alexandroff, who kept up his correspondence with 
Frechet after Urysohn's untimely death. 
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3.3 Open-cover vs. limit-point compactness 

Though Frechet may have been motivated originally to define compactness for 
abstract topological spaces, we have seen that it was the open-cover property 
that eventually took priority over both limit-point compactness and the related 
notion of sequential compactnes^ While these notions are equivalent on metric 
spaces, they are not equivalent on abstract topological spaces. Here we will look 
at one space in which the notions are not equivalent. 

Consider Sq = {a | a is an ordinal number and a < Q} with the order 
topology, where is the first uncountable ordinal number. (See diagram below. 
The first infinite ordinal, w, is the first ordinal after "exhausting" the natural 
numbers. The first uncountable ordinal, fl, is the ordinal after "exhausting" the 
countable ordinals.) 



1 2 



Figure 6: Representation of Sq. 

We know that all closed subsets of compact sets are compact (and vice versa). 
So Sq is not compact since it is not closed in the compact set Sn U {fi}. 

However, it turns out that Sq is limit-point compact. To see why this is 
true, we will use the fact that any countable subset of 5*0 has an upper bound 
in Sq [3SJ p. 67]. If we take any infinite subset of Sq, it has a countably infinite 
subset, which we will call X. Since X is countable, it has an upper bound, let's 
call it b, in Sq. 



t 

b 



n 



Figure 7: Illustration of 5 in Sq. 

But the interval [l,b] is compact since Sq has the l.u.b. property. So there 
must be a point in [1, b] which is a limit point (of both X and any set containing 
it). Thus, Sq is limit-point compact. Essentially the same argument shows 

^Unless otherwise stated, we will be dealing with metric spaces on which limit-point and 
sequential compactness are equivalent. (An infinite subset has a limit point iff there is a 
subsequence converging to that limit.) 
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that any sequence in Sn must have a convergent subsequence in Sn, so Sn is 
sequentially compact. 

We see then that limit-point/sequential compactness and open-cover com- 
pactness do not agree on abstract topological spaces. However, it turns out 
that there is a way to generalize the notion of sequence, using nets and filters 
to formulate concepts similar to sequential compactness that do agree. We will 
define these concepts in the next section. 

3.4 Side note: Compactness and finiteness 

Before we do so, we take a short digression to point out the connection between 
compactness and finiteness, which may be obvious to mathematicians but not 
so obvious to studentiF"! 

Both sequential and open-cover notions of compactness allow us to see com- 
pactness as a generalization of finiteness. From the perspective of sequences: if 
we take any infinite sequence of elements of a finite set, then one element will 
repeat infinitely often. In other words, there is a subsequence of that sequence 
that converges in the most trivial sense; each element has the same value. Sets 
that are sequentially compact have a similar property. Given any sequence, 
we can find a subsequence, not that has exactly the same value, but one that 
converges to some limit. Intuitively, if one squints, then a convergent sequence 
looks constant. 

The open-cover definition can also be seen as a generalization of finiteness. 
While it is not true that all compact sets have a finite number of elements, it is 
true that every compact set can be covered with finitely many small neighbor- 
hoods. If one imagines a compact set in in which open sets are little balls, 
then we can envision a covering of the set by balls which are metaphorically like 
enlarged points. If you squint, those balls can look like points and the compact 
set like a finite set. As Herman Weyl joked: A compact city is one that can be 
guarded by finitely many arbitrarily nearsighted policemen |25l p. 499]! 

4 Sequential compactness in abstract topologi- 
cal spaces 

In the previous section we saw that the two important properties of compactness, 
those stemming from the Bolzano- Weierstrass property and the Borel-Lebesgue 
property, are not equivalent in abstract topological spaces. In this section we 
will show how the former can be generalized to restore that equivalence. We 

^"There are several good references for discussions of the relationship between compactness 
and finiteness. For example, Hewitt gives examples to support his claim that many properties 
in analysis are trivial for finite sets, true and reasonable for compact sets, and either false 
or ridiculously hard to prove for non-compact sets. There is also a nice discussion in 1201 on 
the problem of finding eigenvalues of infinite dimensional matrices. It turns out compactness 
is what takes the place of boundedness for finite spaces. Further, several textbook authors, 
e.g. | 11| . use finiteness as a motivation for compactness. 
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will introduce definitions and theorems about nets and filters, both of which are 
generalizations of sequences which allow us to restate sequential compactness in 
a way that is equivalent to open-cover compactness. 

4.1 Moore and Smith: Nets 

The theory of nets was developed by E. H. Moore and his student H. L. Smitlj^ 
in 1922. As I mentioned in the introduction, it is unclear whether Moore and 
Smith knew how nets could be used to define compactness. This connection is 
usually credited to Birkhoff [281 P- 64], but Moore and Smith did generalize 
some of Frechet's compactness results in the same paper in which they defined 
nets [531 P- 118]. One of the motivations for Moore and Smith's theory was "to 
be able to minimize integrals of the form 

ay 

for a given class of functions 2/(2;)" [Ml P- 113]. Our goal here, though, is to 
express compactness in terms of nets, so we will use the example to motivate 
and illustrate net compactness. 

The problem in the S'n example is that while £7 is a limit point of Su (any 
neighborhood of f2 contains points of Sn), no sequence in Sn converges to j28| 
p. 76]. If we are limited to taking a countable number of elements in the 
sequence, we will never reach f2. Nets provide one way of getting around this 
problem by allowing us to have something like uncountable sequencesj^ In our 
discussion of both nets and filters, we will consider only topological spaces, on 
which the notion of neighborhood is defined. 

To see how nets are a generalization of sequences, it is useful to think of 
sequences as functions on the natural numbers. 

Definition 4.1 A sequence, (denoted {xn}neN = {xi,X2,X3, ...}) is a function 
which assigns to each element n of the natural numbers, N , a functional value 
Xn in a set X . 

We would like to replace N with a set that can be uncountable but has an 
ordering similar to that of N. In other words, we want to stipulate conditions for 
an ordering relation on a generic set that generalizes the way > orders natural 
numbers. We will call this relation >- to suggest the connection to >, and we 
will say that this relation "directs" a given set. 

Definition 4.2 A non-empty set D, with the relation >- is called directed iff 
(i) if di, d2, d^ G D such that di >- d2 and ^2 d^ then di >- d^; 
(a) if di,d2 £ D, then there is a d^ £ D such that dj, >- di and d^ d2. 

^^Little biographical information about Smith is available. I know that he received his 
Ph.D. from University of Chicago under Moore and got a job at Louisiana State University. 
But apparently after his important work on nets and filters, he dropped into obscurity |22l p. 
563] 

l^This treatment follows 28, pp. 62-70] and 32, pp. 281-283, 286-289]. See also [6]. 
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So the definition for net is simply the definition for sequence with N replaced 
by the notion of a directed set. From now on, D wiU stand for a directed set 
with the relation >- as defined above. 

Definition 4.3 A net (denoted {a^djdeD or simply {xd}) is a function which 
assigns to each element d of a directed set D a functional value Xd in a set X . 

As a simple example of a net, take a non-empty totally ordered directed set. 
Every function on such a set is a net. 

Once we know what a net is, we can state what it means for it to converge. 
Again we can derive the definition for net convergence and limit point by taking 
the definitions involving sequences and simply replacing N and > with D and 

y. 

Definition 4.4 A net {xd} converges to a X (denoted {xd} a) iff for 
every neighborhood U of a, there is an index do (z D such that if d >- do then 
Xd U (i.e. if the net is eventually in each neighborhood of a). 

Definition 4.5 A point a is a limit point of {xd} if every neighborhood of a 
contains at least one element of the net. 

In order to state compactness in terms of nets, we also need the concept 
of subnet, the analog of subsequence]^ Part of the definition of subsequence 
generalizes easily, but the other part requires us to think about subsequences in 
a slightly different way than we are accustomed. The first defining property of 
subsequence is that each element of the subsequence can be identified with an 
element of the sequence. This property is generalized in (i) below. The second 
defining property requires that the subsequence is ordered in a similar way as the 
sequence. Usually we require the indices of the subsequence, like the indices of 
the sequence, to be strictly increasing. In other words, for a subsequence {xn^} 
of a sequence {xn}, the are positive integers such that ni < n2 < ■ ■ ■. But 
the feature of this condition which turns out to be important is simply the fact 
that as fc — >■ oo, so do the Uk. This property is generalized in (ii) below. 

Definition 4.6 A subnet of a net {xd}deD is a net {yb}beB where B is a 
directed set and there is a function if : B ^ D such that: 

(i) yb = x^{b) and 

(ii) Vd G D,3foo G B such that if b )~ bo then ip{b) >- d. 

We now are ready to characterize compactness in terms of nets. 

Theorem 4.7 A topological space X is compact iff . . . 

• Every net of points of X has a limit point in X. 

• Every net of points of X has a convergent subnet in X. 

^•^Incidentally, KcUcy, who first coined the term "net," had considered using the term "way" 
so the analog of subsequence would be "subway." McShane also proposed the term "stream" 
for net since he thought it was intuitive to think of the relation of the directed set as "being 
downstream from" [321 p. 282]. 
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Notice that these definitions are precisely the same as hmit point and se- 
quential compactness for metric spaces with the term "net" substituted for 
"sequence." 

Applying these definitions to the Sq example, we can show why Sq is not 
compact. If we take a net Xd of elements of Sq, it is no longer the case that 
there will necessarily be a limit in Sq. In particular, let D = Sq and Xd = d. 
Then Xd converges to 17, which is not in 5*^. Thus no subnet of Xd will converge 
to a point in Sq. 

4.2 Cartan (and Smith): Filters 

Nets are not the only way of generalizing sequences. Another generalization of 
sequence is filter, a notion suggested by Cartan in 1937. These notions are essen- 
tially the same (see [551 P- 83] and [Q), save for a a subtle distinction involving 
a particular type of limit found in the advanced theory of integration [131 p. 
37l|3 

Again, we will define the notions we need to state compactness in terms 
of filters and then apply our compactness result to show Sq is not compact. 
As with nets, we can look at convergence of sequences to motivate the idea of 
convergence of filters. But whereas with nets, the focus was on the index set, 
with filters the focus is on neighborhoods. 

Definition 4.8 Let X he a set. A set (f> of subsets of X is called a filter iff 

(a) Ai a A2 c X and Ai e ^ ^ A2 e ^ 
(Hi) Ai, A2 e ^ ^ Ai n A2 e ^ 

(Dixmier, p. 13). 

An example of a filter is called the co-finite filter: Let X be an infinite set. 
F = {U CX : X\\] is finite}. 

As with nets, we should define what it means for a filter to converge: 

Definition 4.9 A filter $ converges to a ^ A ( denoted $ — > a j iff each neigh- 
borhood of a is a member of $ . 

There is a natural way to associate a filter with any sequence. If xi , 2:2, 2^3, • ■ • 
is a sequence in X, we can associate with this sequence a filter $ on A" such 
that Va e A, {a;„} a iff $ a. In particular, let $ = {A C A' | Bfc^ such 
that \/i > kA,Xi £ A\. 



There is a lot of interesting history and folklore surrounding nets and filters. It turns out 
that nets are more popular in the U.S. and filters in Europe 1461 p. 114], One might think 
that this division comes from the fact that Moore and Smith were American and Cartan was 
French 1281 p. 83]. However, it turns out that Smith actually independently discovered filters 
as an attempt to explain what was lacking in the theory of nets that he and Moore proposed. 
Moreover, the idea behind filters was actually foreshadowed by Riesz in 1909 when he provided 
axioms for topology based on limit points instead of metrics. Riesz defines a concept called 
an "ideal" which is essentially the same as what we now call an ultrafilter _46, p. 111]. 
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Example with /f/\ = 5 



Figure 8: Member of filter with kA — 5. 

So the tails of the sequence are contained in neighborhoods which are mem- 
bers of the filter. The condition that each neighborhood of a is in the filter is 
then equivalent to the condition that the sequence is eventually in any neigh- 
borhood of a. 

In order to define compactness in terms of filters, we need one more notion, 
that of an ultrafiltcr. 

Definition 4.10 A filter in X is an ultrafilter iff no filter in X properly con- 
tains it \15\. p. 4^]- 

The notion of ultrafilter is not exactly analogous to subsequence, but in the 
formulation of compactness, it serves the same purpose. 

Theorem 4.11 A topological space X is compact iff every ultrafilter on X 
converges in X Vol. 2, C, p. 238]. 

Now we can return again to our example and get a sense in terms of filters 
for why Sq_ is not compact]^ We want to show that there is an ultrafilter on 
Sa that does not converge. Consider all the neighborhoods of SI in Sn U Vl. Let 
= {A C 5o I 3a e So. such that V/3 > a, /3 £ A). 

This clearly satisfies the definition for a filter. Let \1/ be any ultrafilter 
containing $. We claim ^ does not converge in Sq,. Suppose it did. Say that 
^! b. Now pick some a > &, a G Sn- Then = {/3 : /? > a} e * since 

^^The proof of this claim, in particular when we assert that there is an ultrafilter containing 
our filter, actually relies on the axiom of choice. 
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a 



Figure 9: Member of filter on Sn- 

C $ C We also have = {/3 : /3 < a} e * since is an open 
neighborhood of b (and we claim 5" converges to b). 

^ A- ► t ► 



b 



Figure 10: Illustration of A and A~^ . 

But A~^ nA~ = 0, which violates the definition of a filter, so our assumption 
must be wrong. Thus, ^' must not converge, and hence Sn is not compact. 

5 Conclusion 

This paper has traced the history of compactness from the original motivating 
questions, through the process of being defined, to more contemporary work that 
restores an equivalence between the two main competing characterizations, the 
Bolzano- Weierstrass property using sequences and the Borel-Lesbesgue property 
using open covers. These concepts are not the same- there are some spaces that 
are compact which are not sequentially compact, and visa versa. However, the 
two concepts agree on metric spaces, and the former can be modified, using nets 
and filters, to be equivalent to the latter. 

We have seen how the notion of compactness evolved from a more intuitive 
notion to a less intuitive one. Given the difficulty and subtlety of the open-cover 
characterization of compactness, it might be useful when teaching compactness 
to either connect back to the more intuitive notions, or actually develop the 
material more along the historical lines discussed here. 

Finally, the idea for compactness grew out of a rather simple, but ultimately 
deep question about the real numbers. The fact that Bolzano (and later Cauchy 
and Weierstrass) saw the need to answer questions about the Intermediate Value 
Property and Maximum Value Property of continuous functions marks a major 
innovation in mathematical thought. The opportunity to expose students to 
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these ideas, which arc within reach of an advanced undergraduate student, seems 
too precious to miss. I hope this paper provides some information, or at least 
some starting points, for doing so. 
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